Mapping Quantitative Traits

Preliminaries

If you are not already familiar with the structure of these exercises, read the Introduction first.

If you have not already worked through the first part of this exercise: Quantitative Traits, begin with that first. That page has the background and information you need to fully understand this case study.

Note

Reminder: Save your work regularly.

Important

If you are using a Mac, we recommend that you use either Chrome or Firefox to complete these exercises. Some of the default settings in Safari prevent these exercises from running.

Contact information

If you have questions about these exercises, please contact Dr. Kevin Middleton (middletonk@missouri.edu) or drop by Tucker 224.

Learning objectives

The learning objectives for this exercise are:

  • Describe what quantitative trait loci (QTL) are and outline QTL are identified
  • Explain how the contributions of many genes of small effect can be associated with a disease or condition
  • Differentiate Mendelian traits from threshold traits
  • Compare Mendelian human diseases and diseases that result from threshold traits

Introduction

In the previous exercise (Quantitative Traits), we saw how we could build up a picture of quantitative, polygenic traits from your existing understanding of Mendelian traits. Furthermore, we saw how many many genes of small effect, each of which added or subtracted a small amount to a phenotype, can produce a continuous (normal) distribution of trait values.

To this point, we have only considered how genes contribute to a quantitative trait and how many genes might contribute to a traits. What we haven’t considered yet is how scientists estimate where in the genome the associated genes are located.

Genetic Variation

Ultimately, different phenotypes – both discrete qualitative phenotypes like blood types and quantitative like heights – result from genetic variation. Many different processes lead to variation, including mutation, drift, and selection among others.

Single Nucleotide Polymorphisms

Although many methods can be used to determine the locations of traits on the genome – “mapping” – one of the most common methods in the genomic era is via single nucleotide polymorphisms (SNPs). As their name suggests, SNPs are alternate nucleotides (e.g., an A or a T) at a single location in the genome. SNPs can occur in both coding and non-coding regions of the genome (Figure 1).

Because most of the genome is identical within a species, SNPs represent a relatively small percentage of the whole genome. For example, in humans, the entire genome consists of over 6 billion base pairs, but a recent study only used 2.3 million SNPs (Yengo et al. 2018). While 2.3 million may seem like a very large number, that represents only 0.04% of the genome.

Figure 1: Single nucleotide polymorphisms are locations in the genome where two alternate nucleotides are observed. In the upper panel, two SNPs are shows, one where three individuals have a A and one with G. In the second SNP, a different combination of three individuals have G and one T. Groups of SNPs that are physically located near one another are grouped into haplotypes (lower panel). Image from Wellcome Sanger Institute.

Associating SNPs with traits

Shapiro pigeon example (dominant trait)

Figure 2: In The Variation of Animals and Plants Under Domestication, Charles Darwin described the array of feather phenotype in domesticated pigeons (Darwin 1868a,b). Image from Memorial University.
Figure 3: FIXME Image from (Shapiro et al. 2013).
Figure 4: FIXME Image from (Shapiro et al. 2013).
Figure 5: R = Arginine; C = Cysteine FIXME Image from (Shapiro et al. 2013).

Human Mendelian diseases are “easy” to identify

Figure 6: FIXME Image from (Visscher et al. 2012).
Figure 7: FIXME Image from (Visscher et al. 2017).

Associating QTLs with genetic variants

Case study: Investigating a newly discovered muscle mutation in mice

Figure 8

Figure 8: The mass of the calf muscles plotted against body mass compared between Unaffected (red) and Mini-muscle phenotype mice (blue). But you do not expect that some mice would have such disproportionately small muscles compared to the others.
Figure 9: Distributions of the the masses of the calf muscles compared between Unaffected (red) and Mini-muscle phenotype mice (blue).
Figure 10: Comparison of the the masses of the calf muscles compared between Unaffected (red) and Mini-muscle phenotype mice (blue). The large points are the means for each group.

GWAS

GWAS Catalog

Primer (Uffelmann et al. 2021)

QTL for Human Height

Estimated to be ~700 explaining ~16% of variation in 2010 (Lango Allen et al. 2010)

  • Best understood quantitative trait in humans

  • Yet still 700 genes

  • Largest GWAS to date involves ~700,000 individuals described by Yengo et al. (2018)

  • 3,290 (“near-independent”) SNPs explain ~25% of the phenotypic variation in human height among a sample of Europeans

Case Study: Threshold traits

Alzheimer (Pedersen et al. 2001)

Cardiac conditions (Walsh et al. 2020)

ASD (Grove et al. 2019)

Summary, Complex disease traits (Pal et al. 2015; Huang 2015)

Schizophrenia (~200 genes)

(Guttmacher et al. 2004; Gillis 2006; Muhrer 2014)

Why family history is one of the most important diagnostic tools in medicine

(Guttmacher et al. 2004; Gillis 2006; Muhrer 2014)

Feedback

FIXME

References

Darwin, C. 1868a. The Variation of Animals and Plants Under Domestication, Vol. 1. John Murray, London.
Darwin, C. 1868b. The Variation of Animals and Plants Under Domestication, Vol. 2. John Murray, London.
Gillis, J. 2006. The History of the Patient History Since 1850. Bull. Hist. Med. 80:490–512. JSTOR.
Grove, J., S. Ripke, T. D. Als, M. Mattheisen, R. K. Walters, H. Won, J. Pallesen, E. Agerbo, O. A. Andreassen, R. Anney, S. Awashti, R. Belliveau, F. Bettella, J. D. Buxbaum, J. Bybjerg-Grauholm, M. Bækvad-Hansen, F. Cerrato, K. Chambert, J. H. Christensen, C. Churchhouse, K. Dellenvall, D. Demontis, S. De Rubeis, B. Devlin, S. Djurovic, A. L. Dumont, J. I. Goldstein, C. S. Hansen, M. E. Hauberg, M. V. Hollegaard, S. Hope, D. P. Howrigan, H. Huang, C. M. Hultman, L. Klei, J. Maller, J. Martin, A. R. Martin, J. L. Moran, M. Nyegaard, T. Nærland, D. S. Palmer, A. Palotie, C. B. Pedersen, M. G. Pedersen, T. dPoterba, J. B. Poulsen, B. S. Pourcain, P. Qvist, K. Rehnström, A. Reichenberg, J. Reichert, E. B. Robinson, K. Roeder, P. Roussos, E. Saemundsen, S. Sandin, F. K. Satterstrom, G. Davey Smith, H. Stefansson, S. Steinberg, C. R. Stevens, P. F. Sullivan, P. Turley, G. B. Walters, X. Xu, Autism Spectrum Disorder Working Group of the Psychiatric Genomics Consortium, BUPGEN, Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium, 23andMe Research Team, K. Stefansson, D. H. Geschwind, M. Nordentoft, D. M. Hougaard, T. Werge, O. Mors, P. B. Mortensen, B. M. Neale, M. J. Daly, and A. D. Børglum. 2019. Identification of Common Genetic Risk Variants for Autism Spectrum Disorder. Nat. Genet. 51:431–444.
Guttmacher, A. E., F. S. Collins, and R. H. Carmona. 2004. The Family History–More Important Than Ever. N. Engl. J. Med. 351:2333–2336. Massachusetts Medical Society.
Huang, Q. 2015. Genetic Study of Complex Diseases in the post-GWAS Era. J. Genet. Genomics 42:87–98. Elsevier.
Lango Allen, H., K. Estrada, G. Lettre, S. I. Berndt, M. N. Weedon, F. Rivadeneira, C. J. Willer, A. U. Jackson, S. Vedantam, S. Raychaudhuri, T. Ferreira, A. R. Wood, R. J. Weyant, A. V. Segrè, E. K. Speliotes, E. Wheeler, N. Soranzo, J.-H. Park, J. Yang, D. Gudbjartsson, N. L. Heard-Costa, J. C. Randall, L. Qi, A. Vernon Smith, R. Mägi, T. Pastinen, L. Liang, I. M. Heid, J. Luan, G. Thorleifsson, T. W. Winkler, M. E. Goddard, K. Sin Lo, C. Palmer, T. Workalemahu, Y. S. Aulchenko, A. Johansson, M. C. Zillikens, M. F. Feitosa, T. Esko, T. Johnson, S. Ketkar, P. Kraft, M. Mangino, I. Prokopenko, D. Absher, E. Albrecht, F. Ernst, N. L. Glazer, C. Hayward, J.-J. Hottenga, K. B. Jacobs, J. W. Knowles, Z. Kutalik, K. L. Monda, O. Polasek, M. Preuss, N. W. Rayner, N. R. Robertson, V. Steinthorsdottir, J. P. Tyrer, B. F. Voight, F. Wiklund, J. Xu, J. H. Zhao, D. R. Nyholt, N. Pellikka, M. Perola, J. R. B. Perry, I. Surakka, M.-L. Tammesoo, E. L. Altmaier, N. Amin, T. Aspelund, T. Bhangale, G. Boucher, D. I. Chasman, C. Chen, L. Coin, M. N. Cooper, A. L. Dixon, Q. Gibson, E. Grundberg, K. Hao, M. Juhani Junttila, L. M. Kaplan, J. Kettunen, I. R. König, T. Kwan, R. W. Lawrence, D. F. Levinson, M. Lorentzon, B. McKnight, A. P. Morris, M. Müller, J. Suh Ngwa, S. Purcell, S. Rafelt, R. M. Salem, E. Salvi, S. Sanna, J. Shi, U. Sovio, J. R. Thompson, M. C. Turchin, L. Vandenput, D. J. Verlaan, V. Vitart, C. C. White, A. Ziegler, P. Almgren, A. J. Balmforth, H. Campbell, L. Citterio, A. De Grandi, A. Dominiczak, J. Duan, P. Elliott, R. Elosua, J. G. Eriksson, N. B. Freimer, E. J. C. Geus, N. Glorioso, S. Haiqing, A.-L. Hartikainen, A. S. Havulinna, A. A. Hicks, J. Hui, W. Igl, T. Illig, A. Jula, E. Kajantie, T. O. Kilpeläinen, M. Koiranen, I. Kolcic, S. Koskinen, P. Kovacs, J. Laitinen, J. Liu, M.-L. Lokki, A. Marusic, A. Maschio, T. Meitinger, A. Mulas, G. Paré, A. N. Parker, J. F. Peden, A. Petersmann, I. Pichler, K. H. Pietiläinen, A. Pouta, M. Ridderstråle, J. I. Rotter, J. G. Sambrook, A. R. Sanders, C. O. Schmidt, J. Sinisalo, J. H. Smit, H. M. Stringham, G. Bragi Walters, E. Widen, S. H. Wild, G. Willemsen, L. Zagato, L. Zgaga, P. Zitting, H. Alavere, M. Farrall, W. L. McArdle, M. Nelis, M. J. Peters, S. Ripatti, J. B. J. van Meurs, K. K. Aben, K. G. Ardlie, J. S. Beckmann, J. P. Beilby, R. N. Bergman, S. Bergmann, F. S. Collins, D. Cusi, M. den Heijer, G. Eiriksdottir, P. V. Gejman, A. S. Hall, A. Hamsten, H. V. Huikuri, C. Iribarren, M. Kähönen, J. Kaprio, S. Kathiresan, L. Kiemeney, T. Kocher, L. J. Launer, T. Lehtimäki, O. Melander, T. H. Mosley Jr, A. W. Musk, M. S. Nieminen, C. J. O’Donnell, C. Ohlsson, B. Oostra, L. J. Palmer, O. Raitakari, P. M. Ridker, J. D. Rioux, A. Rissanen, C. Rivolta, H. Schunkert, A. R. Shuldiner, D. S. Siscovick, M. Stumvoll, A. Tönjes, J. Tuomilehto, G.-J. van Ommen, J. Viikari, A. C. Heath, N. G. Martin, G. W. Montgomery, M. A. Province, M. Kayser, A. M. Arnold, L. D. Atwood, E. Boerwinkle, S. J. Chanock, P. Deloukas, C. Gieger, H. Grönberg, P. Hall, A. T. Hattersley, C. Hengstenberg, W. Hoffman, G. M. Lathrop, V. Salomaa, S. Schreiber, M. Uda, D. Waterworth, A. F. Wright, T. L. Assimes, I. Barroso, A. Hofman, K. L. Mohlke, D. I. Boomsma, M. J. Caulfield, L. A. Cupples, J. Erdmann, C. S. Fox, V. Gudnason, U. Gyllensten, T. B. Harris, R. B. Hayes, M.-R. Jarvelin, V. Mooser, P. B. Munroe, W. H. Ouwehand, B. W. Penninx, P. P. Pramstaller, T. Quertermous, I. Rudan, N. J. Samani, T. D. Spector, H. Völzke, H. Watkins, J. F. Wilson, L. C. Groop, T. Haritunians, F. B. Hu, R. C. Kaplan, A. Metspalu, K. E. North, D. Schlessinger, N. J. Wareham, D. J. Hunter, J. R. O’Connell, D. P. Strachan, H.-E. Wichmann, I. B. Borecki, C. M. van Duijn, E. E. Schadt, U. Thorsteinsdottir, L. Peltonen, A. G. Uitterlinden, P. M. Visscher, N. Chatterjee, R. J. F. Loos, M. Boehnke, M. I. McCarthy, E. Ingelsson, C. M. Lindgren, G. R. Abecasis, K. Stefansson, T. M. Frayling, and J. N. Hirschhorn. 2010. Hundreds of Variants Clustered in Genomic Loci and Biological Pathways Affect Human Height. Nature 467:832–838.
Muhrer, J. C. 2014. The Importance of the History and Physical in Diagnosis. Nurse Pract. 39:30–5; quiz 36. Ovid Technologies (Wolters Kluwer Health).
Pal, L. R., C.-H. Yu, S. M. Mount, and J. Moult. 2015. Insights from GWAS: Emerging Landscape of Mechanisms Underlying Complex Trait Disease. BMC Genomics 16 Suppl 8:S4. bmcgenomics.biomedcentral.com.
Pedersen, N. L., S. F. Posner, and M. Gatz. 2001. Multiple‐threshold Models for Genetic Influences on Age of Onset for Alzheimer Disease: Findings in Swedish Twins. Am. J. Med. Genet. 105:724–728. Wiley.
Shapiro, M. D., Z. Kronenberg, C. Li, E. T. Domyan, H. Pan, M. Campbell, H. Tan, C. D. Huff, H. Hu, A. I. Vickrey, S. C. A. Nielsen, S. A. Stringham, H. Hu, E. Willerslev, M. T. P. Gilbert, M. Yandell, G. Zhang, and J. Wang. 2013. Genomic Diversity and Evolution of the Head Crest in the Rock Pigeon. Science 339:1063–1067.
Uffelmann, E., Q. Q. Huang, N. S. Munung, J. de Vries, Y. Okada, A. R. Martin, H. C. Martin, and T. Lappalainen. 2021. Genome-Wide Association Studies. Nature Reviews Methods Primers 1:1–21. Nature Publishing Group.
Visscher, P. M., M. A. Brown, M. I. McCarthy, and J. Yang. 2012. Five Years of GWAS Discovery. Am. J. Hum. Genet. 90:7–24. cell.com.
Visscher, P. M., N. R. Wray, Q. Zhang, P. Sklar, M. I. McCarthy, M. A. Brown, and J. Yang. 2017. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am. J. Hum. Genet. 101:5–22.
Walsh, R., R. Tadros, and C. R. Bezzina. 2020. When Genetic Burden Reaches Threshold. Eur. Heart J. 41:3849–3855.
Yengo, L., J. Sidorenko, K. E. Kemper, Z. Zheng, A. R. Wood, M. N. Weedon, T. M. Frayling, J. Hirschhorn, J. Yang, P. M. Visscher, and GIANT Consortium. 2018. Meta-Analysis of Genome-Wide Association Studies for Height and Body Mass Index in ∼700000 Individuals of European Ancestry. Hum. Mol. Genet. 27:3641–3649. Oxford University Press (OUP).